Path Normalcy Analysis Using Nearest Neighbor Outlier Detection
نویسندگان
چکیده
We present a machine learning technique that recognizes patterns of normal movement, using GPS data and time stamps, to gain the ability to detect regions of time containing abnormal movement. We argue people move throughout regions of time in established patterns, and a person’s normal movement can be learned by machines. We use intelligent features extracted from raw GPS data with time stamps, to describe a person’s movement over discrete regions of time. Then we use a nearest neighbor approach to determine outliers in a distribution of time regions. We consider outliers as time regions where patterns of established normal movement have been violated. Ultimately, we produce a distance range value for a distribution in conjunction with normalized scores depicting the degree to which each time region contained movement consistent with the other time regions being analyzed. We also produce a classification of each day as normal or abnormal.
منابع مشابه
Spatio-Temporal Outlier Detection Technique
Outlier detection is very important functionality of data mining, it has enormous applications. This paper proposes a clustering based approach for outlier detection using spatio-temporal data. It uses three step approach to detect spatiotemporal outliers. In the first step of outlier detection, clustering is performed on the spatio-temporal dataset with proposed Spatio-Temporal Shared Nearest ...
متن کاملOutlier Detection on High Dimensional Data Using RNN
Background: Outlier detection is an important factor in data mining since it is used in various real time applications. Outlier is an extreme points that are not related to any of the class. Dealing with dimensions is the great challenge, due to “curse of dimensionality”, for effective outlier detection. In a high dimensional data space, it is difficult to detect most related points and most un...
متن کاملRapid Distance-Based Outlier Detection via Sampling
Distance-based approaches to outlier detection are popular in data mining, as they do not require to model the underlying probability distribution, which is particularly challenging for high-dimensional data. We present an empirical comparison of various approaches to distance-based outlier detection across a large number of datasets. We report the surprising observation that a simple, sampling...
متن کاملFast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles
Popular outlier detection methods require the pairwise comparison of objects to compute the nearest neighbors. This inherently quadratic problem is not scalable to large data sets, making multidimensional outlier detection for big data still an open challenge. Existing approximate neighbor search methods are designed to preserve distances as well as possible. In this article, we present a highl...
متن کاملNearest Neighbour Based Outlier Detection Techniques
Outlier detection is an important research area forming part of many application domains. Specific application domains call for specific detection techniques, while the more generic ones can be applied in a large number of scenarios with good results. This survey tries to provide a structured and comprehensive overview of the research on Nearest Neighbor Based Outlier Detection listing out vari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008